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Preface 


The  purpose  of  this  study  was  to  compare  Minimum  Dis¬ 
tance  estimation  techniques  to  the  Method  of  Moments  and 
quasi-clustering  techniques  when  applied  to  mixed  exponen¬ 
tial  and  mixed  normal  distributions.  A  Monte  Carlo  simula¬ 
tion  was  performed,  using  the  Anderson-Darling  goodness-of- 
fit  statistic  as  a  distance  measure  for  the  Minimum  Distance 
method.  Finally,  the  mean-square  error  was  used  as  the  per¬ 
formance  criteria  in  the  comparison  step. 
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Gallagher  and  Capt.  Greg  McIntyre  for  their  invaluable  assist- 
tance  in  the  development  of  the  computer  software.  A  word  of 
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for  their  long-distance  words  of  encouragement  and  support. 
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Abstract 

4 

Minimum  Distance  estimation  was  used  to  calculate 
estimates  of  the  mixing  proportion  of  the  mixture  of  two 
normal  distributions  and  the  mixture  of  two  exponential 
distributions.  The  estimation  was  carried  out  by  using  the 
Golden  Search  technique  to  minimize  the  Anderson-Darling 
goodness-of-fit  statistic.  A  Monte  Carlo  simulation  was  run 
for  both  distribution  mixtures,  varying  the  mixing  propor¬ 
tions  from  .25,  .5  to  .75  with  sample  sizes  of  100  for  the 
normal  mixture  and  750  for  the  mixture  of  exponentials. 

The  simulation  was  run  500  times  for  each  parameter  combin¬ 
ation. 

An  ad  hoc  quasi-clustering  technique  was  used  to  obtain 
the  initial  estimates  for  the  parameters  of  the  mixed  normal 
while  the  method  of  moments  technique  was  used  to  obtain 
initial  estimates  for  the  mixed  exponential  parameters. 

These  estimates  were  then  used  to  start  the  minimum  distance 
routines  which  were  used  to  obtain  new  estimates  of  the  mix¬ 
ing  proportions. 

Finally,  the  mean  square  errors  were  calculated  for  use 
as  a  means  of  comparison  for  the  different  estimation  pro¬ 
cedures  v 

\ 

\ 
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MINIMUM  DISTANCE  ESTIMATION  OF 
A  MIXING  PROPORTION 


I .  Introduction 


In  recent  years,  there  has  been  an  increasing  interest 
in  system  reliability  by  the  military.  This  interest  has 
been  spurred  by  the  higher  cost  of  new  technology  weapon 
systems.  By  the  use  of  statistics,  a  system's  ability  to 
perform  a  given  mission  and  its  reliability  can  be  predicted. 
This  prediction  and  related  predictions  can  be  used  to  pro¬ 
cure  superior  systems  by  the  use  of  multi-criteria  decision 
analysis  or  as  an  aide  to  improving  reliability  through  en¬ 
gineering  design.  These  are  only  a  few  examples  of  the  many 
uses  of  reliability  theory  by  the  military,  however,  the 
first  step  in  the  process  is  reliability  parameter  estima¬ 
tion. 

Every  system's  time-to-failure  can  be  determined  by  a 
probability  density  function  (p.d.f.)  and  its  associated 
cumulative  distribution  function  (c.d.f.)  or  by  a  mixture 
of  p.d.f. 's  and  their  associated  c.d.f. 's.  Classically,  the 
functional  form  of  this  c.d.f.  is  hypothesized  by  one  of  the 
classical  distributions,  based  on  a  set  of  test  data.  After 
the  functional  form  has  been  hypothesized,  its  associated 
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parameters  must  be  estimated.  Prediction  of  system  failure 
rates  depends  largely  on  how  well  these  parameters  are  esti¬ 
mated  using  the  test  data.  Many  methods  have  been  used  for 
this  estimation,  for  example,  the  method  of  moments,  linear 
maps,  and  maximum  likelihood  estimation  (MLE).  All  of  these 
methods  are  widely  accepted  and  are  well  documented  for  com¬ 
mon  distributions  and  a  few  distribution  mixtures.  The  final 
step  in  the  process  is  to  perform  a  goodness-of-fit  test 
which  measures  the  distance  between  the  estimated  distribu¬ 
tion  and  the  hypothesized  sample  distribution  function. 

Recently,  a  new  technique  known  as  minimum  distance 
estimation,  has  been  developed  which,  in  a  sense,  "reverses" 
the  above  procedure.  The  functional  form  of  the  distribution 
is  still  hypothesized,  however,  the  parameters  of  the  p.d.f. 
are  chosen  so  as  to  minimize  the  goodness-of-fit  statistic. 
This  procedure  has  been  applied  to  most  of  the  classical 
distributions  with  good  success.  However,  the  process  has 
not  been  widely  used  when  investigating  mixed  distributions. 

The  objective  of  this  research  is  to  compare  the  initial 
parameter  estimates  obtained  from  the  method  of  moments  and 
quasi-clustering  techniques  to  those  obtained  by  the  use  of 
minimum  distance  estimation.  In  doing  this,  the  search  for 
a  better  estimation  technique  will  be  advanced. 
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II .  Estimation 


When  the  military  uses  a  probability  density  function 
to  approximate  or  describe  a  system's  reliability,  many  of 
the  parameters  associated  with  the  p.d.f.  are  unknown  and 
must  be  estimated.  There  are  several  widely  accepted 
methods  used  for  this  estimation,  such  as  the  Method  of 
Moments,  the  Method  of  Maximum  Likelihood,  the  Method  of 
Least  Squares  and  Bayesian  Estimation.  Of  these,  the  Method 
of  Moments  and  Maximum  Likelihood  have  been  the  most  widely 
used.  Recently,  a  new  method  has  come  under  investigation. 
This  new  method  is  known  as  Minimum  Distance  Estimation. 

With  all  of  these  different  methods  of  estimation  available 
the  question  of  which  method  provides  the  best  parameter 
estimates  arises. 

Criteria  of  Good  Estimators 

Basically,  there  are  four  criteria  that  are  used  either 
individually  or  collectively  to  define  good  estimators. 

These  criteria  are 

1.  Invariance  and  Unbiasedness 

2.  Consistency 

3 .  Efficiency 

4.  Minimum  risk  invariant  and  minimum  variance  unbiased 


It  is  up  to  the  user  to  decide  which  of  the  above 
criteria  is  most  important,  however,  the  estimation  methods 
listed  previously  all  produce  estimates  which  fit  at  least 
some  of  the  above  criteria.  Following  chapters  will  contain 
explanations  of  the  most  commonly  used  methods,  and  the  new¬ 
est  method,  when  applied  to  mixed  distributions. 
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III.  The  Mixed  Model 


As  was  stated  in  the  introduction,  any  system's  time- 
to-failure  can  be  determined  by  a  p.d.f.  and  its  correspond¬ 
ing  c.d.f.  or  by  some  mixture  of  p.d.f.'s  and  their  corres¬ 
ponding  c.d.f.'s.  In  fact,  quite  often  a  system  can  exper¬ 
ience  more  than  one  type  of  failure,  for  example,  fatique 
failure  or  a  sudden  electrical  failure.  When  this  is  true, 
instead  of  using  a  single  distribution  to  describe  the  time- 
to-failure,  the  mixed  model  is  applicable.  Actually,  many 
problems  involving  mixtures  arise  in  several  different 
fields  of  study  other  than  reliability  theory,  for  example, 
biology  (13)  and  remote  sensing  and  space  applications 
(4,32).  This  explains  the  ever  increasing  interest  in  para¬ 
meter  estimation  for  the  mixed  model. 

Mathematically,  a  k-fold  mixed  c.d.f.  is  defined  as 


Fx(x)  *  7.  p-tFxwx)  0<p<l  and  p, 

i=i  i=i 


I  p,  -i 


(3.1) 


and  a  k-fold  mixed  p.d.f.  is  defined  as 


fx(x) 


pifx1(x) 


where 


Fx^x)  -  the  ith  subpopulation  c.d.f. 
fxi(x)  -  the  ith  subpopulation  p.d.f. 

0  <  p  <  l  -  the  mixing  proportion  parameter 


(3.2) 
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It  has  been  common  practice  to  assume  that  the  distributions 
in  a  mixture  belong  to  the  same  family,  but  differ  in  para¬ 
meter  values. 

Previous  Research 

With  a  few  exceptions  most  of  the  previous  research  in¬ 
volving  estimation  of  the  parameters  of  mixed  distributions 
involves  the  Method  of  Moments,  Maximum  Likelihood  and 
Bayesian  techniques,  all  of  which  have  been  proven  to  be 
difficult.  Such  researchers  as  Hosmer(12,21,22) ,  Cohen(9), 
Tan  and  Chang(7),  Hasselblad(l6) ,  Hill (18),  Day (11),  Bowdon 
and  Dick(6),  and  James (25)  have  all  considered  the  problem 
of  estimating  the  parameters  in  mixed  normals.  These  are 
only  a  few  of  the  authors  of  extensive  literature  involving 
the  mixed  normal.  This  extensiveness  is  due  to  the  diverse 
applications  of  the  mixed  normal. 

The  second  most  widely  explored  mixture  is  that  of  the 
exponential.  Cohen(9),  Hill (18),  Hasselblad(17 ) ,  Rider (34) 
Boardman(5),  Ashour  and  Jones (2)  and  Hader  and  Mendenhall 
(15)  have  all  researched  the  area.  A  few  researchers  have 
extended  this  research  to  the  mixed  Weibull,  Falls (14)  and 
Rider (35)  using  the  method  of  moments  and  Kao (26)  using  a 
graphical  method,  for  example. 

In  all  of  this  research  nearly  all  of  the  authors  are 
in  agreement  about  the  difficulty  of  the  estimation  problem, 
due  to  the  complexity  of  the  equations.  In  the  following 
chapters,  the  Method  of  Moments,  the  Method  of  Maximum  Like- 


lihood  and  Minimum  Distance  estimation  will  be  briefly  de¬ 
veloped,  including  the  required  equations. 


IV.  Estimation  Techniques  and  Mixed  Distributions 


The  Method  of  Moments 

One  of  the  oldest  and  simplest  methods  for  deriving 
point  estimators  Is  the  Method  of  Moments.  Due  to  Its 
lack  of  sophistication,  this  method  has  its  faults.  For 
example,  the  estimators  are  usually  not  unbiased  or  the 
most  efficient.  However,  on  the  positive  side,  the  method 
is  relatively  easy  to  employ,  it  provides  consistent  estima¬ 
tors,  and  most  importantly,  it  often  provides  estimators 
when  other  methods  fail  to  do  so.  (30:360) 

The  kth  moment  about  the  origin  of  a  random  variable 
is  defined  as 


M’jt  -  E  <Yk) 


(4.1) 


and  can  be  found  by  taking  the  kth  derivative  of  the  cor¬ 
responding  moment  generating  function  if  it  exists,  and 
evaluating  it  at  t-0.  Mathematically,  we  can  write 


M'k 


dkm(t)/dtk 


t-0 


(4.2) 


The  corresponding  kth  sample  moment  is  the  average 


m'jj  •  l/n 


(4.3) 


* 


A 
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The  method  of  moments  assumes  that  these  sample  moments  will 
be  good  estimates  of  the  corresponding  population  moments 
and  that  the  needed  population  moments  exist.  Since  the 
population  moments  are  the  functions  of  the  population  para¬ 
meters,  the  corresponding  equations  can  be  equated  and  solved 
for  the  desired  parameters.  In  other  words,  "Choose  as  esti¬ 
mates  those  values  of  the  parameters  that  are  solutions  of 
the  equations  -  m'^,  k-1,2, . . .  ,t,  where  t  equals  the 

number  of  parameters . "  (30:357) 

Moments  of  the  Mixed  Normal 

In  this  research,  a  mixture  of  two  normally  distributed 
populations  Is  being  examined.  The  normal  cumulative  dis¬ 
tribution  Is  defined  as 

F(y)  (2s  02)~l/2  exp  -1/2  (y-M/*)2  (4.4) 

and  the  normal  probability  density  function  is  defined  as 

f(y)  -  (exp  -(y-  m  )2/2  *2)  (2#w)“1/2  (4.5) 

where  #>0,  is  the  standard  deviation  or  the  scale  parameter 

-  •  <  u  <  •  ,  is  the  mean  or  the  location  parameter 

-  •  <  y  <  • 

Therefore,  the  mixed  c.d.f.  will  be 
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F(x)  -  p £  (2  tr  #12"1/2  .xp  -i/2(y-  Ml/  *1)2  dy 

♦  (l-p)  (2  9  a  22)_1/2  exp  -l/2(y-M  2/  *  2>2  dV  (4.6) 

From  this  c.d.f.  it  can  be  seen  that  thera  ara  a  total  of 
fiva  paramatars  (  Mi»  M 2>  *i2,  a22,  P)  that  must  1)6  •■timated. 
Tharafora,  tha  first  fiva  momants  must  ba  darivad. 

Tha  moment  generating  function  for  the  normal  distri¬ 
bution  is 


exp  (  at  +  (t2  a2)/2 


(4.7) 


Taking  tha  first  five  derivatives  and  evaluating  at  t-0,  we 
gat  tha  following  momants: 

(  m  +  a  2t)exp(  Mt+(t2  a2)/2)  |t.Q  -m  (4.8) 

S2exp(  Mt+(t2  S2)/2)  +  (  M+ »2t)2exp(  Mt+(  ^2t2)/2)  |t.Q 

-  M  2+  a  2  (4.9) 

a2  [(  M  +  a 2t )exp(  Mt+(t2  a2)/2|  +  2(  M  ♦  a2t)  (  a2) 

axp(  Mt+(t2  a  2)/2)  ♦  (  M  +  a2t)3axp(  Mt+(t2  a2)/2|t.Q 

-  3  M  a2  +  *  3  (4.10) 

a2[a2axp(  m t+(t2  a2)/2)  +  (  m+ a2t)2exp(  Mt+(t2  a2)/2 

♦  2  a4exp<  m t-Mt2  a2)/2)  +2a2(M+a2t)2 
•*P(  m  t+(t2  a2)/2)  ♦  3(M+#2t)2(a2) 

axp(  Mt+(t2  a2)/2)  ♦  (  M+#2t)4axp(  j»t+(t2  a2)/2)  L.0 
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-  3*4  +  6*2M2  +  M4  (4*11) 

[2  (  ft  +  e  2t )  (  e  4 )  +  2(  >*+ S2t)  (2  M4  +  6  94(  fl+  92t) 

+  4(#i+*2t)3(ff2)J  exP(  M  t+(  t2  M  2 )  /  2 ) 

+  (  M+  *2t)exp(  Mt+(t2  *2)/2)  [*4+ff2(  M+  *2t)2 
+  2  *4+2  *2(  fi+  S2t)2  +  3*2(#A+M2t)2  +  (  M  +  ^2t)4||t-0 

-  M5  +  10  M3  ^2  +  15  M  (4.12) 

These  five  moments,  when  equated  to  their  corresponding 
sample  moments  and  rearranged  to  accommodate  the  mixture, 
yield  the  following  equations  which,  when  solved,  provide 
the  five  parameter  estimates. 

^XA/n  -  p  Mj  +  (1-p)  M2  (4.13) 

^Xi2/n  -  p(  Mi2+  Mi2)  +  (1-P)(  M22+  *22)  (4-14) 

^X13/n  -  p(  3  M  x  «x2  +  Mx3)  +  ( 1-p)  (  3  M  2  *22  +  M23)  (4-15) 

^Xt4/n  -  p(  3  #x4  +  6  #x2Mx2  +  Mx4)  +  ( 1-p)  ( 3  #24 

♦  6  *22  M22  *  M  23)  (4.16) 

Sx^/n  -  P(  M  I5  ♦  10  Mx3  #x2  ♦  15  Mi  Mi4)  ♦  (1-P)  (  M25 

♦  10  M23  #22  +  15  M 2  #24)  (4-l?) 

Moments  of  the  Mixed  Exponential 

The  second  distribution  that  will  be  used  in  this 
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research  is  the  exponential.  The  exponential  c.d.f.  and 
p.d.f.  are  respectively. 


F(t)  - 

l-exp(-t/8  ) 

t>0 

(4.18) 

f(t)  - 

(1/#  )exp(-t/» 

)  ,  t>0,  9  > 0 

(4.19) 

From  the 

above  we  get  the  mixed  c.d.f.  of 

F(x)  - 

p(l-exp(-x/  #1))  +  (l-p) (l-exp(-x/  #2)) 

(4.20) 

As  with  the  normal,  we  can  see  that  there  are  three  parame¬ 
ters  (  #1,  # 2,  p)  that  must  be  estimated.  This  requires 

the  first  three  moments  which  can  be  derived  from  the  expo¬ 
nential  moment  generating  function.  This  function  is 

Mx  (t)  -  (1-  $  t)*1 

(4.21) 

As  before,  we  take  the  first  three  derivatives  and 

at  t-0  to  yield  the  following. 

evaluate 

l. 

•  (l-  #tr2  |t.0  .  # 

(4.22) 

2. 

2  • 2 ( 1-  f t)"3 

t.O  •  2,2 

(4.23) 

3. 

<  9  3(1-  *  t)"4 

t-0  •  4,3 

(4.24) 
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One*  again,  equate  the  above  population  moments  to  their 
corresponding  sample  moments  and  rearranged  for  a  mixture 
to  obtain  the  required  equations. 


^xA/n  -  p  +  (l- 

■P)  •  2 

(4.25) 

^xt2/n  -  2p  #x2  + 

<1-P)2  t22 

(4.26) 

Xxi3/n  +  6p  #x3  + 

(l-p)6  t 23 

(4.27) 

The  Method  of  Maximum  Likelihood 

The  Method  of  Maximum  Likelihood  is  a  much  more  sophis 
ticated  method  of  estimation  than  the  method  of  moments. 

In  being  so,  it  is  more  difficult  to  apply,  however,  the 
the  method  is  usually  better  than  the  method  of  moments  and 
yields  asymptotically  best  estimators  which  explains  its 
popularity. 

The  method  begins  with  a  random  sample  from  a  probabll 
ity  density  function  with  unknown  parameters.  The  likeli¬ 
hood  function  is  now  defined  as  the  joint  density  function 
of  the  n  random  variables,  and  can  be  written  as, 

n 

L  "  iSl  fx  <*1'*  •  )  (4.28) 

where  #  is  an  unknown  parameter 

If  the  random  sample  is  from  a  multiparameter  density  func¬ 
tion,  then 


n 

L*  ^1;  ^  ]()  (4.29) 

The  maximum-likelihood  estimators  (MLE)  of  the  parameters 
are  the  values  of  the  parameters  which  maximize  L  or  equiva¬ 
lently  In  L.  Quite  often  these  values  are  the  solutions  to 
the  following  equations. 

d InL/ d  # i  -  0  i-1 ,2, . . . ,k  (4.30) 

(29:82) 

Maximum-Likelihood  Estimation  and  the  Mixed  Normal 

Maximum- likelihood  estimation  of  the  mixed  normal,  in 
its  long  history,  has  proven  to  be  very  difficult.  This 
difficulty  arises  largely  in  the  fact  that  the  likelihood 
equations  do  not  have  an  explicit  solution  and  instead, 
must  be  solved  by  iterative  methods. 

Given 

f^  -  frequency  of  the  observations  (4.31) 

<3ij  -  l/(2»  *j2)1/2  expf-fXi  -  Mj)2/ 2ffj2  (4.32) 
we  can  let 

$ 

Qi  "  <3ij  Pj  (4.33) 

so  that  the  log  of  the  likelihood  function  is  approximately 
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L  - 


(4.34) 


n 

Z  fi  LntQi) 

i»l 

If  we  take  the  partial  derivatives  and  set  them  equal  to 


zero  we  get  the  following 

solved  iteratively. 

set  of  equations  that 

must  be 

0  - 

d  L/  - 

n 

am 

P;)<Ji;j<xi  " 

M j  )/ o j2 

j *1 , 2 , . . . ,k  ( 

0  - 

<)  L/  ^  0  j  * 

n 

2  fi/Qi 
i«l  1  1 

n 

PiQij  (xt  - 

M j  )2/  9 j3 

j-1,2, - k  ( 

0  - 

d  L/  d  pj  * 

^ij'^ik) 

j  *1 / 2 , • • 

.  ,k-l  (4.37) 

Two  possible  methods  for  solving  these  equations  could  be 
Newton ' s  Method  or  the  method  of  Steepest  Descent . 
(16:432-436) 

Maximum  Likelihood  Estimation  and  the  Mixed  Exponential 
Similarly,  for  the  mixed  exponential  where, 

exp(-x/f  )  for  x>0 

Pjfj  (*) 

log  of  the  likelihood  function  as 
L  ■  A  In  g(x^)  (4.40) 


(4.38) 


(4.39) 


fj(X)  -  1/0 

and 

i 

9<«> 

we  can  write  the 


4.35) 

4.36) 
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By  taking  the  partial  derivatives,  equating  them  to  zero, 
and  simplifying  as  illustrated  by  Hasselblad  (1966),  we 
get  the  following  equations  which  must  be  solved  by 
"successive  substitution"  iteration.  (17:1462) 


Pi(v+i)  -  Pjv/N  Z,  f-j  (xk)/g(xi) 


,V/H  1 


(4.41) 


9 j  (v+1)  -  fj  (v)  (x^xj/g^  2  fj  (v)  (x±/g(v)  (xt 


)  (4.42) 


where  v+1  -  current  iteration 
v  *  last  iteration 


N 
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V.  Minimum  Distance  Estimation 

Minimum  Distance  estimation  is  a  relatively  new  estima¬ 
tion  technique  which  is  in  the  forefront  of  statistical 
research.  The  method  was  first  developed  by  J.  Wolfowitz  in 
a  series  of  articles  beginning  in  1952.(36,37)  In  these  arti¬ 
cles  he  developed  the  method  as  a  means  of  deriving  "super- 
consistent"  estimators  when  classical  methods  failed  to.  (36: 
1953).  During  the  following  years  much  work  was  done  in  the 
field  of  minimum  distance  applications  to  classical  distribu¬ 
tions.  Much  of  this  work  was  done  by  Dr.  Albert  Moore  and 
his  thesis  students.  For  example,  Hobbs,  Moore  and  James (19) 
investigated  minimum  distance  estimation  and  the  three  para¬ 
meter  Gamma  distribution  while  Hobbs,  Moore,  and  Miller (20) 
applied  the  method  to  the  three  parameter  Weibull.  Very 
recently  minimum  distance  estimation  has  been  under  study  for 
robustness.  Parr  and  Schucany(33)  were  two  of  the  first  :.o 
investigate  this  area  and  did  show  that  minimum  distance 
techniques  provide  robust  estimators  for  the  location  para¬ 
meters  of  symmetric  distributions.  Since  then  Daniels (10) 
applied  Parr's  methodology  to  the  generalized  t  distribution, 
and  Miller (31)  to  the  three  parameter  Weibull,  among  others. 

Wolfowitz 's  minimum  distance  method  begins  by  defining 
the  distance  between  two  distribution  functions  as 


<Fl>F2)  - 


sup 

X 


Fl(x)  -  F2(X) 


(5.1) 
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where  sup  means  supremum. 


The  method  is  in  no  way  tied  to  this  definition  and  can  be 
applied  with  many  different  definitions  of  distance.  (36:10) 
For  the  purpose  of  this  research  however,  the  above  definition 
of  distance  will  be  used. 

The  method  next  selects  a  random  sample  of  size  n 
from  a  known  distribution  and  determines  the  empirical  dis¬ 
tribution,  Sn(x).  The  empirical  distibution  is  defined  as 


Sn(x)  *  number  of  values  <  x  /  total  number  in  sample  (n)  (5.2) 


and  is  a  non-decreasing  step  function  with  jumps  of  size  l/n 
at  each  observed  value.  (3:269) 

The  final  step  is  to  determine  the  parameter  estimates 
by  minimizing  the  calculated  distances  between  the  empirical 
distribution  function  and  the  true  or  hypothesized  distribu¬ 
tion.  There  are  several  different  distance  measures  that 
are  commonly  used  for  this  step,  including  the  Kolmogorov- 
Smirnov,  the  Cramer-von  Mises,  and  the  Anderson-Darling 
statistics . 


Kolmogorov-Smirnov 

The  Kolmogorov-Smirnov  statistic  is  defined  as 


D  -  sup  F( x )  -  Sn(x) 


(5.3) 
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In  other  words,  the  largest  absolute  deviation  between  the 
hypothesized  distribution  and  the  empirical  distibution. 
Computationally , 


D  -  max(D+,  D- ) 

(5.4) 

where  D+  and  D“  are  defined  as  follows 

D+  =  max((i/n)  -  z^),  1  <i<  n 

(5.5) 

D-  =  maxfzj^  -  (i-l)/n),  1  <i<  n 

(5.6) 

and 

z±  =  Ffx^ 

(5.7) 

(3:269-270) 


Cramer-von  Mises 

The  Cramer-von  Mises  distance  technique  is  defined  by 

W2  -  n/'C°  [sn(x)-F(x)|2  *(F(x))dF(x)  (5.8) 

“CO 

where  (F(x))  is  a  weighing  factor  or  computationally 

n 

W2  -  [l/(12n)|  +5^  [zi  -  (2i-l)/2n|2  (5.9) 


where  z^  is  defined  as  before.  (24:17) 


Anderson-Darling 

The  Anderson-Darling  distance  measure  is  the  measure 
used  for  this  research.  It  is  a  special  case  of  the  Cramer- 
von  Mises  distance  measure  where 


*(F(x))  -  1/F(x) (l-F(x) ) 


(5.10) 


This  weighting  factor  causes  more  emphasis  to  be  placed  upon 
the  tails  of  the  distribution.  (24:18)  The  statistic  can 
be  computed  by 


l2  -[-  (£  (2i-l)(ln(Zi)  +  lna-z^.jnj/n]  -  n  (5.11) 


where  Zj^  is  defined  as  before.  ( 1 : 765 )  For  the  mixed  normal 

A2  -  -[£  (2i-1)(ln(p£X,(21r  a  12)-1/2exp[-i/2(y-  /i^/  <r1)2]dy 

+  (l-p)^*'  (2  X  a  22)~1/2exp[-l/2(y- //2/  *2)2]d y) 

+  ln(l-(p£tn+1"‘(2  x  x2 ) -1/2exp  J-l/2 (y- // x/  <r^)2]  dy 
+  (l-p£XlH1''(2»  <r22)_1/2exp[-l/2(y-  //2/  <r2)2]  dy)  )j /n 


(5.12) 


and  for  the  mixed  exponential 

A2  -  -[2(21-1) (ln(p(l-exp(-xi/  #x)  +  (l-p) (l-exp(-xi/  *2)) 

+  ln(l-(p(  1-exp  (-Xj^.j/  *1))  +  (l-p)  (l-expf-Xj^.i/  0  2 }  >  >  >] 

/nl -n  (5.13) 
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These  are  the  equations  which  will  be  minimized  in  this 
research  to  obtain  the  minimum  distance  estimate  of  p  where 
all  other  parameters  have  been  previously  estimated  and  are 
assumed  true. 
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The  methodology  developed  in  this  thesis  assumes  that 
the  underlying  distributions  are  mixtures  of  distributions 
from  the  families  of  the  Normal  distributions  and  the  One- 
Parameter  Exponential  distributions,  as  defined  in  equations 
4.6  and  4.20  respectively.  In  1984,  Woodward,  Parr  and 
associates  applied  Minimum  Distance  techniques  to  the  mixed 
normal,  the  mixed  double  exponential,  and  the  mixed  t-distri- 
bution.(27)  In  Parr's  research,  the  distance  measure  chosen 
was  the  Cramer-von  Mises  goodness-of-fit  statistic  which 
differs  from  the  current  research.  However,  this  research 
is  an  extention  of  Parr's  study. 


Monte  Carlo  Simulation 

Monte  Carlo  simulation  techniques  have  been  widely  used 
in  previous  parameter  estimation  studies.  The  technique 
allows  you  to  specify  a  particular  underlying  distribution 
from  which  to  draw  the  sample  and,  in  doing  so,  allows  the 
performance  of  the  estimators  to  be  checked.  The  Monte 
Carlo  simulation  applied  to  this  research  consists  of  the 
following  steps. 

(1)  Generate  the  data  from  the  specified  distributions. 

(2)  Determine  the  initial  parameter  estimates  either 
by  using  the  Method  of  Moments  or  quasi-clustering 
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techniques . 


(3)  Determine  the  Minimum  Distance  estimates  of  p,  using 
the  estimates  of  step  two  as  starting  values. 

(4)  Compare  the  performance  of  the  estimators. 

Each  of  these  steps  will  be  explained  in  detail  in  the 
remainder  of  this  chapter.  At  this  point,  it  should  be  sta¬ 
ted  that  the  computer  time  required  to  accomplish  this  simu¬ 
lation  was  very  great.  This  limited  the  number  of  cases 
that  could  be  investigated. 

Sample  Generation  from  the  Mixed  Normal 

Six  samples  from  the  mixed  normal  distribution  were 
generated  using  the  method  described  by  Parr  and  associates 
(27).  The  first  step  was  to  generate  100  standard  normal 
deviates  by  use  of  the  IMSL  subroutine  GGNPM(23).  The  second 
step  in  the  process  was  to  determine  which  component  of  the 
mixture  the  deviate  would  be  assigned  to.  This  was  done  by 
first  selecting  the  desired  mixing  proportion,  then  genera¬ 
ting  100  (0,1)  uniform  deviates  from  the  IMSL  subroutine 
GGUBS  (23).  The  normal  deviates  were  assigned  to  the  first 
component  if  the  uniform  deviate  was  less  than  the  mixing 
proportion,  p,  and  to  the  second  if  it  was  greater  than  p. 

The  final  step  was  to  scale  the  normal  deviates  according  to 
the  assigned  component.  An  observation  from  f^x)  -  Y  where 
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Y  was  tha  standard  normal  davlate,  and  an  cbsarvation  from 
f2(x)  •  Y+b  where  b>0  and  is  dafined  as  tha  diffaranca  in 
location  paramatars.  Tha  paramatar  combinations  usad  war a 


P-.75 

o 

o 

a 

r-A 

5t 

n 2*2 . 32 

*1"  *2ml 

P-.75 

^1*0.0 

fi 2“  3 .6 

*lm  *2ml 

P-.50 

V-L-0.0 

fl 2*2 .56 

*1-  *2ml 

P-.50 

Hi-0.0 

^2*3.76 

ai-  *2ml 

P-.25 

^1*0.0 

H 2*2 . 32 

*1"  a2ml 

P-.25 

/i  2  *  3 .6 

*1-  *2ml 

These  combinations  were  selected  so  that  the  analysis  of 
this  study  could  be  compared  to  that  of  Parr  (27). 

Sample  Generation  from  the  Mixed  Exponential 

Sample  generation  for  the  two  component  mixture  of  one 
parameter  exponentials  was  much  simpler  than  that  of  the 
mixed  nornal .  Nine  samples  of  size  750  were  generated  using 
tha  IMSL  subroutine  GGEXT(23).  This  subroutine  automatically 
generates  deviates  from  a  specified  mixture  and  therefore 
requires  no  manipulation.  The  generated  samples  had  the  fol¬ 
lowing  paramatar  combinations. 
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P-.75 


•  i-2.0 


•  2-0.5 


P-.50 

#1-2.0 

#2*0 . 5 

P-.25 

#1-2 . 0 

#2*0.5 

P-.75 

•i-1.0 

#2*0.5 

P-.50 

#1*1  •  0 

#2*0.5 

p- .  25 

#  ^-1 .0 

#2*0.5 

P-.75 

#2*0.5 

P-.50 

•i-3.0 

#2*0.5 

P-.25 

•i-3.0 

#2-0.5 

These  parameter  values  were  chosen  to  give  varying  degrees 
of  separation  between  the  two  components. 

After  the  samples  were  generated  the  initial  parameter 
estimates  had  to  be  determined.  This  turned  out  to  be  a 
very  difficult  task. 

initial  Parameter  Estimates  for  the  Normal  Mixture 

Due  to  the  complexity  of  both  the  Method  of  Moments 
equations (4. 13-4. 17)  and  the  Maximum  Likelihood  equations 
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(4.35-4.37),  the  solutions  to  these  equations  are  both  dif¬ 
ficult  and  time  consuming  to  obtain.  Because  of  this  an 
alternate  method  was  chosen,  in  woodward  and  Parr's  paper 
(27),  they  obtained  starting  values  for  the  ML  technique  from 
an  easy  to  implement  quasi-clustering  technique,  in  their 
final  analysis  these  initial  estimates  were  found  to  be  high- 
f  ly  competitive  with  the  maximum  likelihood  estimates.  For 

this  reason  and  because  of  its  simplicity,  this  technique  was 
selected  for  use  in  this  research  and  it  is  explained  below. 

First  1*2  is  assumed  to  be  true.  Next,  the  initial 

estimate  of  the  mixing  proportion,  p0,  is  allowed  to  take  on 
the  values  of  .l,.2,...,.9  only.  The  sample  is  then  ordered 
and  divided  into  two  subsamples,  Y1,Y2,...Yn  and  Yn+l,  •••Yn, 
for  each  allowed  p0  value,  where  n^^  ■  np  rounded  to  the 
nearest  integer.  The  initial  estimate  of  p  is  the  value  p0 
that  maximizes 

p(l-p)  (ir^-mj)2  (6.1) 

where  nuj  is  the  sample  median  of  the  jth  subsample. 

The  choice  for  the  estimates  of  n ^  and  //2  are  the  corres¬ 
ponding  m^  and  m2  values  of  the  above  p  estimate.  Finally, 
the  estimates  of  *22  and  o±2  are  determined  by  the 
following  equations. 

*12(0)  "  ( ,25> )  /  . 6745 ) 2  (6.2) 
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(6.3) 


#22(0)  *  ( <r2( *75)'m2)  /  -6745)2 

where  rj^  Is  the  qth  quartile  from  the  jth  cluster. 

Initial  Parameter  Estimates  for  the  Exponential  Mixture 

The  Method  of  Moments  technique  was  chosen  to  obtain 
the  starting  estimates  for  the  case  of  the  mixed  exponential. 

The  moment  equations  were  defined  in  Chapter  4,  equations 
(4.25,4.26,4.27).  The  first  attempt  to  solve  these  equations 
was  made  using  the  method  of  false  position.  These  attempts 
failed  however.  Instead  of  converging  they  diverged  and 
therefore  a  new  method  was  searched  for.  The  method  selected 
was  that  of  Paul  Rider (34). 

Rider  suggested  combining  the  three  moment  equations. 

In  doing  so  and  then  simplifying,  the  following  quadradic  is 
obtained 

6(2m,12-m,2)  0j*2  +  2  (m'  3-3m'  ^m'  2 )  0-j*  +  3m'  22-2m’  1m'  3  -  0  (6.4) 

where  m'^  is  the  ith  sample  moment.  (34:144) 

The  roots  this  quadratic  are  estimates  of  and  02  and  in 
this  research  they  were  obtained  by  the  use  of  the  IMSL  sub¬ 
routine  ZQADR.(23)  With  these  two  estimates  an  estimate  of 
p  can  be  obtained  from 

p*  -  (m^  -  «2*)/(  9i*  -  «2*)  (6.5) 
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Minimum  Distance  Estimates 


After  obtaining  the  initial  estimates,  the  minimum  dis¬ 
tance  estimates  of  the  mixing  proportion  were  found  for  both 
the  normal  and  the  exponential  mixture.  Refer  to  Chapter  5 
for  the  explanation  of  minimum  distance  methods. 

The  distance  measure  selected  for  this  research  was  the 
Anderson-Darling  goodness-of-fit  statistic.  In  other  words, 
the  parameter  estimates  were  selected  so  as  to  minimize  this 
statistic.  The  method  used  to  perform  this  minimization  was 
the  Golden  Search.  The  following  is  a  general  algorithm  for 
the  method . ( 28 ) 

Assumption:  f(x)  is  unimodal  on  [a,b]  and  r  -.618034 

Step  0.  Let  x«a  and  x-b 
xL-*-rf*-x) 
xR-x+r (x-x) 

Step  l.  Evaluate  f(xL)  and  f(xR) 

Step  2.  f ( x^ )  a  f  (xR)£>x-xR  ;  xR»x^ 
xL  -  x-r(x-x) 

f(xL)  «  f(xR)S>X"*L  ;  xL-xR 
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xR  -  x  +  r(x-x) 

L  -  x-x 

Step  3.  If  L< €  ,  stop  x  $x*i  x 
Otherwise,  go  to  step  1. 

For  this  research  f(x),  the  Anderson-Darling  statistic,  is 
defined  in  equation  5.12  for  the  mixed  normal  and  in  equation 
5.13  for  the  mixed  exponential.  €  was  chosen  to  be  .0001, 
and  the  lntial  values  of  a  and  b  are  D  and  1  respectively, 
since  0<p<l . 

In  this  research,  when  evaluating  f(x)  the  c.d.f.  for 
the  appropriate  distribution  must  also  be  evaluated.  In 
the  case  of  the  mixed  normal  however,  the  c.d.f.  must  be 
approximated.  This  approximation  was  performed  by  the  IMSL 
subroutine  MDNOR . ( 23 ) 


Comparison  Criteria 

After  both  parameter  estimations  have  been  found,  the 
process  is  repeated  500  times.  This  is  done  for  both  the 
mixed  normal  and  the  mixed  exponential.  Using  the  estimates 
from  the  500  replications,  the  mean  square  error  for  both 
mixing  proportion  estimates  was  calculated  by 


MSE  -  1/500  >  (Pi~P)' 


(6.6) 
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A  comparison  of  the  two  different  MSE  calculations  gave  the 
indication  of  which  estimation  was  best. 

A  sample  of  the  computer  code  used  to  implement  the 
above  simulation  for  the  mixed  exponential  can  be  found  in 
the  appendix.  It  should  be  noted  once  again  that  computer 
time  needed  for  the  program  was  very  large  when  run  on  AFIT's 
SSC  computer  system. 
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VI.  Conclusions  and  Recommendations 


Summary 

This  thesis  compares  the  Method  of  Moments  and  quasi¬ 
clustering  estimation  techniques  to  Minimum  Distance  tech¬ 
niques  when  used  to  estimate  the  mixing  proportions  of  mixed 
one  parameter  exponentials  and  mixed  normals,  respectively. 

The  Method  of  Moments  estimates  were  used  as  the  baseline 
for  the  mixed  exponentials,  while  the  quasi-clustering  esti¬ 
mates  were  the  baseline  for  the  mixed  normals .  This  compar¬ 
ison  was  carried  out  by  a  computer  routine  which  performed 
a  Monte  Carlo  simulation,  then  calculated  the  mean  square 
errors.  These  errors  were  used  as  the  criteria  for  comparison. 

Conclusions 

At  the  beginning  of  this  research  it  was  hoped  that  the 
minimum  distance  estimation  technique  would  provide  better 
estimates  than  the  baseline  techniques.  In  the  case  of  the 
mixed  exponential  this  does  not  appear  to  be  true  as  can  be 
seen  in  table  I.  However,  the  small  mean  square  errors  show 
both  estimators  to  be  good  and  despite  the  fact  that  the 
Method  of  Moments  technique  appears  better,  it  has  problems. 

First  of  all,  the  method  of  moments  technique  requires 
very  large  sample  sizes  in  order  to  ensure  that  the  roots  of 
equation  6.4  are  positive.  32: 144)  Even  in  the  case  of  the 
sample  size  of  500  used  for  this  research,  one  of  the  para- 
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meter  combinations  still  became  infeasible.  (Denoted  on 
table  1  by  *).  On  the  other  hand,  the  minimum  distance  tech¬ 
nique  will  converge  in  relatively  few  iterations  with  any 
decent  initial  estimates,  ie.  educated  guesses,  graphical 
estimates,  etc.  Secondly,  the  method  is  much  more  difficult 
to  apply  than  the  minimum  distance  routine  due  to  the  compli¬ 
cated  moment  calculations.  For  this  reason,  and  due  to  the 
closeness  of  the  mean  square  errors,  the  method  of  minimum 
distance  is  recommended. 

In  the  case  of  the  mixed  normal,  the  results  were  much 
different.  Parr's  analysis,  which  used  the  Cramer-von  Mises 
statistic,  had  results  indicating  similar  mean  square  errors 
for  the  minimum  distance  and  the  clustering  estimation  tech¬ 
niques.  In  the  current  research  however,  the  clustering 
method  proved  to  be  superior.  This  was  due  to  the  inability 
of  the  Anderson-Darling  statistic  to  distinguish  between 
the  .75  and  the  .25  mixing  proportions.  For  this  reason, 
use  of  the  Anderson-Darling  statistic  as  a  distance  measure 
for  the  minimum  distance  technique,  when  applied  to  mixed 
normals,  is  not  recommended. 

Suggested  Research 

Extending  the  above  research  to  include  mixed  Weibulls 
is  suggested.  Due  to  the  strength  and  versatility  of  the 
Weibull  distribution  as  a  failure  model,  this  extension 
could  be  a  powerful  aid  in  the  search  for  better  parameter 
estimation  techniques. 
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TABLE  I 


Simulation  Results  for  Mixtures  of 
Two  Exponential  Components 


True  Mean 

Mixing 

Estimator 

MSE 

Values 

Proportion 

1  =  2.0 

.75 

MDE 

.0428782 

2  =  0.5 

Moment 

.0339208 

.50 

MDE 

.0281973 

Moment 

.0208788 

.25 

MDE 

.0159149 

Moment 

.0134047 

1  =  1.0 

.75 

MDE 

2  =  0.5 

Moment 

* 

.50 

MDE 

.0910827 

Moment 

.0862087 

.25 

MDE 

.105535 

Moment 

.101567 

1  -  3.0 

.75 

MDE 

.0381142 

2  -  0.5 

Moment 

.0235877 

.50 

MDE 

.0265675 

Moment 

.0136226 

.25 

MDE 

.011018 

i 

1 

Moment 

.00732790 

TABLE  II. 


Simulation  Results  for  Mixtures  of 
Two  Normal  Components 
(equal  shape  parameters) 


True  Mean 

Values 

Mixing 

Proportion 

Estimator 

MSE 

1  3  o.o 

2  =  2.32 

.75 

MDE 

Clustering 

.122337 

.0183400 

L  =  0.0 

2  =  2.56 

.50 

MDE 

Clustering 

.0338620 

.0114600 

1  =  0.0 

2  =  2-32 

.25 

MDE 

Clustering 

.332602 

.0194600 

1  =  0.0 

2  =  3.6 

.75 

MDE 

Clustering 

.153062 

.00726004 

x  =  0.0 

2  -  3.76 

.50 

MDE 

Clustering 

.0221589 

.00671999 

!  -  0.0 

2  -  3-6 

.25 

MDE 

Clustering 

.309898 

.00826004 

APPENDIX:  COMPUTER  CODE 

c  ****************************************************** 

c 

c  This  program  generates  estimates  of  the  mixing  pro- 

c  portion  fo  two  exponential  distributions  using  the 

c  both  the  Method  of  Moments  and  the  Minimum  Distance 

c  estimation  techniques, 

c 

c  ****************************************************** 

real  samp(750) ,true(3) ,temp,min(3) ,mse,a,b,c 

real  sammom( 3 ) , meanl,mean2,estip,x,y,z,msemom, tempt ( 3 ) 

real  xl,yl,zl 

integer  nr,n,m,d,ier 

complex  zsm,zlg 

common  true(3) 

double  precision  dseed 

external  ggext 

nr-750 

true(l)=  2.0 
true(2)=*  .50 
true(3)»  .25 
mse-0 .0 
msemom-0 . 0 
print* , '  ' 

print* , ' exponential :  meanl=2 . 0 ,mean2= . 50 ,p= . 25 ' 
print*,' 

dseed-295847397.d0 

Q  ******************************************** ********** 

C 

c  Begin  the  iteration  loop 

c 

Q  ****************************************************** 

do  10  k-l,500 
sammom(l)-0.0 
sammom(2)-0.0 
sammom(3)-0.0 
P-.25 

true ( 1 ) »2 . 0 
true(2)-.50 
true( 3)-p 
xml-2 
xm2- . 5 
nr-750 

C 

c  Generate  deviates  from  a  mixture  of  two  exponentials 

c  by  the  use  of  the  IMSL  routine  GGTEXT  and  sort  the 
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o  n 


data. 

****************************************************** 
call  ggtext ( dseed , p , xml , xm2 , nr , samp , ier ) 
print*,'  ' 
do  500  i-1,749 
d-i+1 

do  400  j  »d , 750 

if  (samp(i)  .gt.  samp(j))  then 
temp-samp ( i ) 
samp( i ) -samp( j ) 
samp( j )-temp 

end  if 

400  continue 

continue 

****************************************************** 

Calculate  the  first  three  sample  moments. 
****************************************************** 

do  600  i-1,750 

sammom( l )-sammom( l )+samp( i ) 
sammom( 2 ) -sammomj  2 ) +samp ( i ) **2 
sammom( 3 )-sammom( 3 )+samp( i) **3 
600  continue 

sammom(l )-sammom(l ) /750 
sammom( 2 ) -sammom( 2 ) /750 
sammom( 3 ) *sammom( 3 ) /750 

Q  ****************************************************** 

C 

c  Find  the  moment  estimators  by  use  of  the  IMSL  rou- 

c  tine  ZQADR.  This  routine  finds  the  roots  of  a  qua- 

c  dratic. 

c 

Q  ****************************************************** 

print*,'  ' 

nr-750 

m-3 

n-2 

xl-sammom( 1 ) 
yl-sammom( 2 ) 

2l-sammom( 3 ) 

x-xl 

y-yl 

z-zl 

a-6* ( 2*x**2-y ) 


500 

c 

c 

c 

c 

c 
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o  o 


b-2* ( z-3*x*y ) 


c-3*y**2-2*x*z 

call  zqadr(a,b,c,zsm,zlg, ier ) 

print* , '  sammom  check '  ,  ( sammom (  i ) ,  i-1 , 3  ) 

meanl-real ( zlg) 

mean2-real(zsm) 

estip-  ( sanunom  ( 1 )  -mean2 )  /meanl-mean2 ) 

print*, 'method  of  moments  estimates ' ,meanl ,mean2, estip 

q  ****************************************************** 

C 

c  Calculate  the  minimum  distance  estimates  by  using 

c  the  Golden  Search  method  to  minimize  the  Anderson- 

c  Darling  Goodness-of-fit  statistic, 

c 

Q  ****************************************************** 

tempt(l)-true(l) 

tempt ( 2 ) -true ( 2 ) 

tempt ( 3 ) -true ( 3 ) 

true ( 1 )-meanl 

true { 2 ) -mean2 

true( 3 )-estip 

call  mindis(nr, samp, m, min) 

print*,'  ' 

print* , 'minimum  distance  estimates' 
print* , ' iteration  ' ,k 
print* , (min( i ) , i-1 , 3 ) 

Q  ****************************************************** 

c 

c  Calculate  the  mean  square  error  for  the  minimum  dis- 

c  tance  and  the  method  of  moments  estimators . 

c 

Q  ****************************************************** 

mse-mse+(min( 3)-p)**2 
msemom-msemom+ ( estip-p ) **2 
10  continue 

mse-mse/500 

msemom-msemom/500 

print* , '  ' 

print*, 'mse-  ' ,mse 
print* , '  1 

print*, 'msemom-  ' ,msemom 

stop 

end 


****************************************************** 


37 


Subroutine  gof.  Calculates  the  Anderson-Darling  gof 
statistic  for  the  Golden  Search  routine. 


subroutine  gof (nr , samp, m, adgof) 

real  samp ( 750 ) , true( 3 ) , adgof ,cuml ( 750 ) ,cum2 ( 750 ) , z ( 750 ) 

real  sum .temp 

common  true(3) 

integer  nr,m,i 

nr-750 

do  10  i-l,nr 

cuml ( i ) -l-exp( -l*samp( i ) /true ( 1 ) ) 
cum2 ( i ) -1-exp ( -l*samp( i) /true ( 2 ) ) 
z ( i ) -true( 3 )*cumi ( i )+ ( l-true( 3 ) ) *cum2 ( i) 
if  (z(i) .It. 0.001) then 
z ( i ) -0 . 001 
end  if 

if  (z( i) .gt.O .000 ) then 
z  (  i )  • . 999 
end  if 
continue 
sum-0.0 
do  20  i-l,nr 

temp-(2*i-l)*(alog(z(i) )+alog(i-z(nr+l-i) ) ) 
sum- sum+ temp 
continue 

adgof- ( (-i*sum)/nr)-nr 

return 

end 


****************************************************** 

Subroutine  mindis.  Calculates  the  minimum  distance 
estimates  by  means  of  a  golden  search 

* ***************************************************** 
subroutine  mindis (nr , samp, m, min) 

real  samp(750)  ,true(3)  ,min(3,)  ,nr,m, adgof ^a.fb/Step^.a 
real  b, left, right, f left, fright 
integer  i 
common  true(3) 

step-. 01 
r- .618034 
a-0.0 

true(3)-0.0 

call  gof (nr, samp, m, adgof) 
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fa-adgof 

b-l.O 

true( 3)-left 

call  gof  (nr/Samp^.adgof ) 

fleft-adgof 

true( 3 ) -right 

call  gof (nr ,samp,m,adgof ) 

fright-adgof 

i-0 

continue 

if( fleft.lt. fright)  then 
b-right 
fb- fright 
right-left 
fright-fleft 
left-b-r* (b-a) 
true( 3 )-left 

call  gof (nr,samp,m,adgof ) 
fleft-adgof 

else 

a-left 
fa-fleft 
left-right 
fleft-fright 
right-a+r*(b-a) 
true( 3 )-right 
call  gof (nr , samp,m,adgof ) 
fright-adgof 
end  if 
i-i+l 

if (abs(fb-fa) .gt. . 0001. and. i. It. 200) then 
go  to  40 

else 

go  to  50 
end  if 
continue 

min(l)-true(l) 
min(2)-true(2) 
min( 3 )-true( 3 ) 
print*,'  ' 
print* , ' adgof ' 
print* ,adgof 
return 
end 
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